Pesquisa | Portal Regional da BVS

1.

Predicting and reasoning about replicability using structured groups.

Wintle, Bonnie C; Smith, Eden T; Bush, Martin; Mody, Fallon; Wilkinson, David P; Hanea, Anca M; Marcoci, Alexandru; Fraser, Hannah; Hemming, Victoria; Thorn, Felix Singleton; McBride, Marissa F; Gould, Elliot; Head, Andrew; Hamilton, Daniel G; Kambouris, Steven; Rumpff, Libby; Hoekstra, Rink; Burgman, Mark A; Fidler, Fiona.

R Soc Open Sci ; 10(6): 221553, 2023 Jun.

Artigo em Inglês | MEDLINE | ID: mdl-37293358

RESUMO

This paper explores judgements about the replicability of social and behavioural sciences research and what drives those judgements. Using a mixed methods approach, it draws on qualitative and quantitative data elicited from groups using a structured approach called the IDEA protocol ('investigate', 'discuss', 'estimate' and 'aggregate'). Five groups of five people with relevant domain expertise evaluated 25 research claims that were subject to at least one replication study. Participants assessed the probability that each of the 25 research claims would replicate (i.e. that a replication study would find a statistically significant result in the same direction as the original study) and described the reasoning behind those judgements. We quantitatively analysed possible correlates of predictive accuracy, including self-rated expertise and updating of judgements after feedback and discussion. We qualitatively analysed the reasoning data to explore the cues, heuristics and patterns of reasoning used by participants. Participants achieved 84% classification accuracy in predicting replicability. Those who engaged in a greater breadth of reasoning provided more accurate replicability judgements. Some reasons were more commonly invoked by more accurate participants, such as 'effect size' and 'reputation' (e.g. of the field of research). There was also some evidence of a relationship between statistical literacy and accuracy.

2.

Predicting reliability through structured expert elicitation with the repliCATS (Collaborative Assessments for Trustworthy Science) process.

Fraser, Hannah; Bush, Martin; Wintle, Bonnie C; Mody, Fallon; Smith, Eden T; Hanea, Anca M; Gould, Elliot; Hemming, Victoria; Hamilton, Daniel G; Rumpff, Libby; Wilkinson, David P; Pearson, Ross; Singleton Thorn, Felix; Ashton, Raquel; Willcox, Aaron; Gray, Charles T; Head, Andrew; Ross, Melissa; Groenewegen, Rebecca; Marcoci, Alexandru; Vercammen, Ans; Parker, Timothy H; Hoekstra, Rink; Nakagawa, Shinichi; Mandel, David R; van Ravenzwaaij, Don; McBride, Marissa; Sinnott, Richard O; Vesk, Peter; Burgman, Mark; Fidler, Fiona.

PLoS One ; 18(1): e0274429, 2023.

Artigo em Inglês | MEDLINE | ID: mdl-36701303

RESUMO

As replications of individual studies are resource intensive, techniques for predicting the replicability are required. We introduce the repliCATS (Collaborative Assessments for Trustworthy Science) process, a new method for eliciting expert predictions about the replicability of research. This process is a structured expert elicitation approach based on a modified Delphi technique applied to the evaluation of research claims in social and behavioural sciences. The utility of processes to predict replicability is their capacity to test scientific claims without the costs of full replication. Experimental data supports the validity of this process, with a validation study producing a classification accuracy of 84% and an Area Under the Curve of 0.94, meeting or exceeding the accuracy of other techniques used to predict replicability. The repliCATS process provides other benefits. It is highly scalable, able to be deployed for both rapid assessment of small numbers of claims, and assessment of high volumes of claims over an extended period through an online elicitation platform, having been used to assess 3000 research claims over an 18 month period. It is available to be implemented in a range of ways and we describe one such implementation. An important advantage of the repliCATS process is that it collects qualitative data that has the potential to provide insight in understanding the limits of generalizability of scientific claims. The primary limitation of the repliCATS process is its reliance on human-derived predictions with consequent costs in terms of participant fatigue although careful design can minimise these costs. The repliCATS process has potential applications in alternative peer review and in the allocation of effort for replication studies.

Assuntos

Ciências do Comportamento , Confiabilidade dos Dados , Humanos , Reprodutibilidade dos Testes , Custos e Análise de Custo , Revisão por Pares

3.

Are Experts Well-Calibrated? An Equivalence-Based Hypothesis Test.

Dharmarathne, Gayan; Hanea, Anca M; Robinson, Andrew.

Entropy (Basel) ; 24(6)2022 May 27.

Artigo em Inglês | MEDLINE | ID: mdl-35741478

RESUMO

Estimates based on expert judgements of quantities of interest are commonly used to supplement or replace measurements when the latter are too expensive or impossible to obtain. Such estimates are commonly accompanied by information about the uncertainty of the estimate, such as a credible interval. To be considered well-calibrated, an expert's credible intervals should cover the true (but unknown) values a certain percentage of time, equal to the percentage specified by the expert. To assess expert calibration, so-called calibration questions may be asked in an expert elicitation exercise; these are questions with known answers used to assess and compare experts' performance. An approach that is commonly applied to assess experts' performance by using these questions is to directly compare the stated percentage cover with the actual coverage. We show that this approach has statistical drawbacks when considered in a rigorous hypothesis testing framework. We generalize the test to an equivalence testing framework and discuss the properties of this new proposal. We show that comparisons made on even a modest number of calibration questions have poor power, which suggests that the formal testing of the calibration of experts in an experimental setting may be prohibitively expensive. We contextualise the theoretical findings with a couple of applications and discuss the implications of our findings.

4.

Bayesian networks for risk analysis and decision support.

Hanea, Anca M; Christophersen, Annemarie; Alday, Sandra.

Risk Anal ; 42(6): 1149-1154, 2022 06.

Artigo em Inglês | MEDLINE | ID: mdl-35770819

Assuntos

Técnicas de Apoio para a Decisão , Teorema de Bayes , Medição de Risco

5.

Reimagining peer review as an expert elicitation process.

Marcoci, Alexandru; Vercammen, Ans; Bush, Martin; Hamilton, Daniel G; Hanea, Anca; Hemming, Victoria; Wintle, Bonnie C; Burgman, Mark; Fidler, Fiona.

BMC Res Notes ; 15(1): 127, 2022 Apr 05.

Artigo em Inglês | MEDLINE | ID: mdl-35382867

RESUMO

Journal peer review regulates the flow of ideas through an academic discipline and thus has the power to shape what a research community knows, actively investigates, and recommends to policymakers and the wider public. We might assume that editors can identify the 'best' experts and rely on them for peer review. But decades of research on both expert decision-making and peer review suggests they cannot. In the absence of a clear criterion for demarcating reliable, insightful, and accurate expert assessors of research quality, the best safeguard against unwanted biases and uneven power distributions is to introduce greater transparency and structure into the process. This paper argues that peer review would therefore benefit from applying a series of evidence-based recommendations from the empirical literature on structured expert elicitation. We highlight individual and group characteristics that contribute to higher quality judgements, and elements of elicitation protocols that reduce bias, promote constructive discussion, and enable opinions to be objectively and transparently aggregated.

Assuntos

Revisão por Pares

6.

Co-designing and building an expert-elicited non-parametric Bayesian network model: demonstrating a methodology using a Bonamia Ostreae spread risk case study.

Hanea, Anca M; Hilton, Zoë; Knight, Ben; P Robinson, Andrew.

Risk Anal ; 42(6): 1235-1254, 2022 06.

Artigo em Inglês | MEDLINE | ID: mdl-35187670

RESUMO

The development and use of probabilistic models, particularly Bayesian networks (BN), to support risk-based decision making is well established. Striking an efficient balance between satisfying model complexity and ease of development requires continuous compromise. Codesign, wherein the structural content of the model is developed hand-in-hand with the experts who will be accountable for the parameter estimates, shows promise, as do so-called nonparametric Bayesian networks (NPBNs), which provide a light-touch approach to capturing complex relationships among nodes. We describe and demonstrate the process of codesigning, building, quantifying, and validating an NPBN model for emerging risks and the consequences of potential management decisions using structured expert judgment (SEJ). We develop a case study of the local spread of a marine pathogen, namely, Bonamia ostreae. The BN was developed through a series of semistructured workshops that incorporated extensive feedback from many experts. The model was then quantified with a combination of field and expert-elicited data. The IDEA protocol for SEJ was used in its hybrid (remote and face-to-face) form to elicit information about more than 100 parameters. This article focuses on the modeling and quantification process, the methodological challenges, and the way these were addressed.

Assuntos

Haplosporídios , Teorema de Bayes , Julgamento , Modelos Estatísticos

7.

Balancing the Elicitation Burden and the Richness of Expert Input When Quantifying Discrete Bayesian Networks.

Barons, Martine J; Mascaro, Steven; Hanea, Anca M.

Risk Anal ; 42(6): 1196-1234, 2022 06.

Artigo em Inglês | MEDLINE | ID: mdl-34146431

RESUMO

Structured expert judgment (SEJ) is a method for obtaining estimates of uncertain quantities from groups of experts in a structured way designed to minimize the pervasive cognitive frailties of unstructured approaches. When the number of quantities required is large, the burden on the groups of experts is heavy, and resource constraints may mean that eliciting all the quantities of interest is impossible. Partial elicitations can be complemented with imputation methods for the remaining, unelicited quantities. In the case where the quantities of interest are conditional probability distributions, the natural relationship between the quantities can be exploited to impute missing probabilities. Here we test the Bayesian intelligence interpolation method and its variations for Bayesian network conditional probability tables, called "InterBeta." We compare the various outputs of InterBeta on two cases where conditional probability tables were elicited from groups of experts. We show that interpolated values are in good agreement with experts' values and give guidance on how InterBeta could be used to good effect to reduce expert burden in SEJ exercises.

Assuntos

Julgamento , Projetos de Pesquisa , Teorema de Bayes , Prova Pericial , Probabilidade , Incerteza

8.

What is a Good Calibration Question?

Hemming, Victoria; Hanea, Anca M; Burgman, Mark A.

Risk Anal ; 42(2): 264-278, 2022 02.

Artigo em Inglês | MEDLINE | ID: mdl-33864272

RESUMO

Weighted aggregation of expert judgments based on their performance on calibration questions may improve mathematically aggregated judgments relative to equal weights. However, obtaining validated, relevant calibration questions can be difficult. If so, should analysts settle for equal weights? Or should they use calibration questions that are easier to obtain but less relevant? In this article, we examine what happens to the out-of-sample performance of weighted aggregations of the classical model (CM) compared to equal weighted aggregations when the set of calibration questions includes many so-called "irrelevant" questions, those that might ordinarily be considered to be outside the domain of the questions of interest. We find that performance weighted aggregations outperform equal weights on the combined CM score, but not on statistical accuracy (i.e., calibration). Importantly, there was no appreciable difference in performance when weights were developed on relevant versus irrelevant questions. Experts were unable to adapt their knowledge across vastly different domains, and in-sample validation did not accurately predict out-of-sample performance on irrelevant questions. We suggest that if relevant calibration questions cannot be found, then analysts should use equal weights, and draw on alternative techniques to improve judgments. Our study also indicates limits to the predictive accuracy of performance weighted aggregation, and the degree to which expertise can be adapted across domains. We note limitations in our study and urge further research into the effect of question type on the reliability of performance weighted aggregations.

Assuntos

Julgamento , Calibragem , Reprodutibilidade dos Testes

9.

Uncertainty Quantification with Experts: Present Status and Research Needs.

Hanea, Anca M; Hemming, Victoria; Nane, Gabriela F.

Risk Anal ; 42(2): 254-263, 2022 02.

Artigo em Inglês | MEDLINE | ID: mdl-33629402

RESUMO

Expert elicitation is deployed when data are absent or uninformative and critical decisions must be made. In designing an expert elicitation, most practitioners seek to achieve best practice while balancing practical constraints. The choices made influence the required time and effort investment, the quality of the elicited data, experts' engagement, the defensibility of results, and the acceptability of resulting decisions. This piece outlines some of the common choices practitioners encounter when designing and conducting an elicitation. We discuss the evidence supporting these decisions and identify research gaps. This will hopefully allow practitioners to better navigate the literature, and will inspire the expert judgment research community to conduct well powered, replicable experiments that properly address the research gaps identified.

Assuntos

Julgamento , Incerteza

10.

Predicting species and community responses to global change using structured expert judgement: An Australian mountain ecosystems case study.

Camac, James S; Umbers, Kate D L; Morgan, John W; Geange, Sonya R; Hanea, Anca; Slatyer, Rachel A; McDougall, Keith L; Venn, Susanna E; Vesk, Peter A; Hoffmann, Ary A; Nicotra, Adrienne B.

Glob Chang Biol ; 27(18): 4420-4434, 2021 Sep.

Artigo em Inglês | MEDLINE | ID: mdl-34117681

RESUMO

Conservation managers are under increasing pressure to make decisions about the allocation of finite resources to protect biodiversity under a changing climate. However, the impacts of climate and global change drivers on species are outpacing our capacity to collect the empirical data necessary to inform these decisions. This is particularly the case in the Australian Alps which have already undergone recent changes in climate and experienced more frequent large-scale bushfires. In lieu of empirical data, we use a structured expert elicitation method (the IDEA protocol) to estimate the change in abundance and distribution of nine vegetation groups and 89 Australian alpine and subalpine species by the year 2050. Experts predicted that most alpine vegetation communities would decline in extent by 2050; only woodlands and heathlands are predicted to increase in extent. Predicted species-level responses for alpine plants and animals were highly variable and uncertain. In general, alpine plants spanned the range of possible responses, with some expected to increase, decrease or not change in cover. By contrast, almost all animal species are predicted to decline or not change in abundance or elevation range; more species with water-centric life-cycles are expected to decline in abundance than other species. While long-term ecological data will always be the gold standard for informing the future of biodiversity, the method and outcomes outlined here provide a pragmatic and coherent basis upon which to start informing conservation policy and management in the face of rapid change and a paucity of data.

Assuntos

Mudança Climática , Ecossistema , Animais , Austrália , Biodiversidade , Plantas

11.

Weighting and aggregating expert ecological judgments.

Hemming, Victoria; Hanea, Anca M; Walshe, Terry; Burgman, Mark A.

Ecol Appl ; 30(4): e02075, 2020 06.

Artigo em Inglês | MEDLINE | ID: mdl-31971641

RESUMO

Performance weighted aggregation of expert judgments, using calibration questions, has been advocated to improve pooled quantitative judgments for ecological questions. However, there is little discussion or practical advice in the ecological literature regarding the application, advantages or challenges of performance weighting. In this paper we (1) illustrate how the IDEA protocol with four-step question format can be extended to include performance weighted aggregation from the Classical Model, and (2) explore the extent to which this extension improves pooled judgments for a range of performance measures. Our case study demonstrates that performance weights can improve judgments derived from the IDEA protocol with four-step question format. However, there is no a-priori guarantee of improvement. We conclude that the merits of the method lie in demonstrating that the final aggregation of judgments provides the best representation of uncertainty (i.e., validation), whether that be via equally weighted or performance weighted aggregation. Whether the time and effort entailed in performance weights can be justified is a matter for decision-makers. Our case study outlines the rationale, challenges, and benefits of performance weighted aggregations. It will help to inform decisions about the deployment of performance weighting and avoid common pitfalls in its application.

Assuntos

Ecologia , Julgamento , Incerteza

12.

Eliciting improved quantitative judgements using the IDEA protocol: A case study in natural resource management.

Hemming, Victoria; Walshe, Terry V; Hanea, Anca M; Fidler, Fiona; Burgman, Mark A.

PLoS One ; 13(6): e0198468, 2018.

Artigo em Inglês | MEDLINE | ID: mdl-29933407

RESUMO

INTRODUCTION: Natural resource management uses expert judgement to estimate facts that inform important decisions. Unfortunately, expert judgement is often derived by informal and largely untested protocols, despite evidence that the quality of judgements can be improved with structured approaches. We attribute the lack of uptake of structured protocols to the dearth of illustrative examples that demonstrate how they can be applied within pressing time and resource constraints, while also improving judgements. AIMS AND METHODS: In this paper, we demonstrate how the IDEA protocol for structured expert elicitation may be deployed to overcome operational challenges while improving the quality of judgements. The protocol was applied to the estimation of 14 future abiotic and biotic events on the Great Barrier Reef, Australia. Seventy-six participants with varying levels of expertise related to the Great Barrier Reef were recruited and allocated randomly to eight groups. Each participant provided their judgements using the four-step question format of the IDEA protocol ('Investigate', 'Discuss', 'Estimate', 'Aggregate') through remote elicitation. When the events were realised, the participant judgements were scored in terms of accuracy, calibration and informativeness. RESULTS AND CONCLUSIONS: The results demonstrate that the IDEA protocol provides a practical, cost-effective, and repeatable approach to the elicitation of quantitative estimates and uncertainty via remote elicitation. We emphasise that i) the aggregation of diverse individual judgements into pooled group judgments almost always outperformed individuals, and ii) use of a modified Delphi approach helped to remove linguistic ambiguity, and further improved individual and group judgements. Importantly, the protocol encourages review, critical appraisal and replication, each of which is required if judgements are to be used in place of data in a scientific context. The results add to the growing body of literature that demonstrates the merit of using structured elicitation protocols. We urge decision-makers and analysts to use insights and examples to improve the evidence base of expert judgement in natural resource management.

Assuntos

Tomada de Decisões , Austrália , Análise Custo-Benefício , Feminino , Humanos , Julgamento , Masculino , Recursos Naturais , Distribuição Aleatória

13.

The Value of Performance Weights and Discussion in Aggregated Expert Judgments.

Hanea, Anca M; McBride, Marissa F; Burgman, Mark A; Wintle, Bonnie C.

Risk Anal ; 38(9): 1781-1794, 2018 09.

Artigo em Inglês | MEDLINE | ID: mdl-29665625

RESUMO

In risky situations characterized by imminent decisions, scarce resources, and insufficient data, policymakers rely on experts to estimate model parameters and their associated uncertainties. Different elicitation and aggregation methods can vary substantially in their efficacy and robustness. While it is generally agreed that biases in expert judgments can be mitigated using structured elicitations involving groups rather than individuals, there is still some disagreement about how to best elicit and aggregate judgments. This mostly concerns the merits of using performance-based weighting schemes to combine judgments of different individuals (rather than assigning equal weights to individual experts), and the way that interaction between experts should be handled. This article aims to contribute to, and complement, the ongoing discussion on these topics.

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA